Discrete multichannel audio with a backward compatible mix

ABSTRACT

A multichannel audio format provides a truly discrete as well as a backward compatible mix for surround-sound, front or other discrete audio channels in cinema, home theater, or music environments. The additional discrete audio signals are mixed with the existing discrete audio channels into a predetermined format such as the 5.1 audio format. In addition these additional discrete audio channels are encoded and appended to the predetermined format as extension bits in the bitstream. The existing base of multichannel decoders can be used in combination with a mix decoder to reproduce truly discrete N.1 multichannel audio.

PRIORITY INFORMATION

This application is a divisional application of Ser. No. 09/568,355filed on May 10, 2000 and claims priority of the same.

BACKGROUND OF THE INVENTION

1. Field of the Invention

This invention relates to multichannel audio and more specifically to amultichannel audio format that provides a truly discrete as well as abackward compatible mix for surround-sound, front or other discreteaudio channels in cinema, home theater, or music environments.

2. Description of the Related Art

Multichannel audio has become the standard for cinema and home theater,and is gaining rapid acceptance in music, automotive, computers, gamingand other audio applications. Multichannel audio provides asurround-sound environment that greatly enhances the listeningexperience and the overall presentation of any audio-visual system. Theearliest multichannel systems included left, right, center and surround(L, R, C, S) channels. The current standard in consumer applications is5.1 channel audio, which splits the surround channel into left and rightsurround channels and adds a subwoofer channel (L, R, C, Ls, Rs, Sub).

The move from stereo to multichannel audio has been driven by a numberof factors paramount among them being the consumers' desire for higherquality audio presentation. Higher quality means not only more channelsbut higher fidelity channels and improved separation or “discreteness”between the channels. In a truly discrete environment, discrete channelscarry discrete audio signals to discrete speakers.

To satisfy this demand, the audio industry had to provide a multichannelmix from the studio or content provider, multichannel encoding/decodingtechniques, a media capable of supporting multichannel audio andmultichannel speaker configurations. By its very nature, multichannelaudio includes significantly more data than stereo audio, which has tobe compressed to fit in the existing formats and on the existing media.With the advent of media such as DVD, new formats such as 5.1 have beendeveloped specifically for multichannel audio to enhance the listeningexperience.

The extension of multichannel audio beyond the 5.1 standard has onceagain raised the challenge of developing new encoding/decodingtechniques that move the state-of-the-art forward while maintainingbackward compatibility with the 5.1 standard. Having become accustomedto discrete audio, the consumer will demand the same performance as morechannels are added. Backward compatibility is critical because of thegreat investment in 5.1 equipment by consumers and professionals alike.

Dolby Prologic™ provided one of the earliest multichannel systems.Prologic squeezes 4-channels (L, R, C, S) into 2-channels (Lt, Rt) byintroducing a phase-shifted surround sound term. These 2-channels arethen encoded into the existing 2-channel formats. Decoding is a two stepprocess in which an existing decoder receives Lt, Rt and then a Prologicdecoder expands Lt, Rt into L, R, C, S. Because four signals (unknowns)are carried on only two channels (equations), the Prologic decodingoperation is only an approximation and cannot provide true discretemultichannel audio. As shown in FIG. 1, a studio 10 will mix several,e.g. 48, audio sources to provide a four-channel mix (L, R, C, S). (Thismix may be monitored through a matrix encode and decode process.) ThePrologic encoder 12 matrix encodes this mix as follows:Lt=L+0.707C+S(+90°), and   (1)Rt=R+0.707C+S(−90),   (2)which are carried on the two discrete channels, encoded into theexisting two-channel format and recorded on a media 14 such as film.

A matrix decoder 16 decodes the two discrete channels Lt, Rt and expandsthem into four discrete reconstructed channels Lr, Rr, Cr and Sr. Apassive matrix decoder decodes the audio data as follows:Lr=Lt,Rr=Rt,Cr=(Lt+Rt)/2, andSr=(Lt−Rt)/2.In general, the Lr and Rr channels have significant center and surroundcomponents and Cr and Sr have left and right components. The reproducedaudio signals, although carried on discrete channels to discretespeakers in a speaker configuration 18, are not discrete, but in factare characterized by significant crosstalk and phase distortion. Forthis reason passive decoders are rarely used.

Active matrix decoders reduce crosstalk and phase distortion but at bestapproximate a discrete audio presentation. Many different proprietaryalgorithms are used to perform an active decode and all are based onmeasuring the power of Lt+Rt, Lt−Rt, Lt and Rt to calculate gain factorsGi whereby,Lr=G1*Lt+G2*RtRr=G3*Lt+G4*RtCr=G5*Lt+G6*Rt, andSr=G7*Lt+G8*Rt.Active decode provides better compensation based on the power of thesignal but crosstalk among components remains and true discretereproduction is not possible.

The advent of the 5.1 format represented a fundamental shift inmultichannel audio away from squeezing multiple channels into anexisting stereo format and the phase distortion and crosstalk associatedwith matrix coding and to a truly discrete multichannel format, whichprovides higher fidelity and improved separation and directionality.Furthermore, two additional channels were added. The subwoofer(“Sub”)(0.1 channel) provides enhanced low frequency capability. Thesurround channel S consists of left Ls and right Rs channels indicatingthe consumers' strong preference for true discrete sound even in thesurround channels. Each signal (L, C, R, Ls, Rs, Sub) is compressedindependently and then mixed together in a 5.1 format therebymaintaining the discreteness of each signal. Dolby AC-3™, Sony SDDS™ andDTS Coherent Acoustics™ are all examples of 5.1 systems.

As shown in FIG. 2, the studio 20 provides a 5.1 channel mix. A 5.1encoder 22 compresses each signal or channel independently, multiplexesthem together and packs the audio data into a given 5.1 format, which isrecorded on a suitable media 24 such as a DVD. A 5.1 decoder 26 decodesthe bitstream a frame at a time by extracting the audio data,demultiplexing it into the 5.1 channels and then decompressing eachchannel to reproduce the signals (Lr, Rr, Cr, Lsr, Rsr, Sub). These 5.1discrete channels, which carry the 5.1 discrete audio signals aredirected to the appropriate discrete speakers in speaker configuration28 (subwoofer not shown).

In its cinema products, DTS implemented its 5.1 system with 5 singlechannel APT-X encoders by taking advantage of the spectralcharacteristics of the surround and subwoofer channels withoutsacrificing performance. The use of five rather than six processorsreduced system cost As shown in FIG. 3, the 5.1 signal is reformattedinto a 5 channel signal with a mixer 32 that mixes the Ls, Sub and Rsinto two channels using standard studio mixing techniques, i.e. the subis reduced by 3 dB and added to the L and R surround channels. Morespecifically, the left and right surround channels Ls, Rs are high passfiltered, the subwoofer channel Sub is low pass filtered, and then mixedtogether. The Sub channel carries low frequencies and has a bandwidthless than 150 Hz and the Ls and Rs signals have only minimal lowfrequency content. An APT-X decoder 34 decodes the five channels andpasses Lts and Rts to a demixer 36, which high pass filters them toreproduce Lrs and Rrs, and low pass filters and sums them to reproducethe subwoofer channel Sub.

Extension to discrete 6.1 and higher multichannel formats is limited byspace availability on the media, reliability and the strong desire tomaintain backward compatibility with existing 5.1 decoders. Multichannelaudio consumes a lot of space on the medium. Providers want to extendplaytime, include multiple different audio formats including 2-channelPCM, Dolby AC-3 and DTS Coherent Acoustics, add other content such asdirector's comments, outtakes, etc.

Dolby has developed Dolby EX, as described in PCT PublicationW099/57941, which provides more than two surround-sound channels in thecurrent 5.1 formats and does so without increasing space requirements(number of bits or film space). Dolby EX provides more than two surroundsound channels within the format of a digital soundtrack system designedto provide only two surround sound channels. Three main channels arerecorded in the discrete soundtrack channels and 3, 4 or 5surround-sound channels are matrix-encoded and recorded in two discretesurround-sound soundtrack channels. The digital audio stream of thedigital soundtrack system designed to provide only two surround soundchannels remains unaltered, thus providing compatibility with existingplayback equipment. Moreover, the format of the media carrying thedigital sound tracks is unaltered. Dolby asserts that the “discreteness”of the digital soundtrack system is not audibly diminished by employingmatrix technology to surround sound channels, particularly if activematrix decoding is employed.

Dolby EX introduces phase-shifted surround sound terms to matrix encodethe 3, 4 or 5 surround-sound signals into two channels, whichfacilitates decoding the two channels into 3, 4 or 5 audio channels. Theintroduction of the phase-shifted terms is essential to Dolby EX as itwas to Dolby Prologic. The encoding process is given by the followinggeneralized equations:Lts=Ls+ΣGi*Si(φi) for i=0, 1, 2, andRts=Rs+ΣHi*Si(φi) for i=0, 1, 2where Gi and Hi are the gain coefficients, Si are the additionalsurround-sound channels and φi are the phase distortion components. Thedecoding process is given by the following generalized equations:Lrs=G1*Lts+G2*RtsRrs=G3*Lts+G4*RtsCrs=G5*Lts+G6*RtsIn the special case of three surround-sound channels (Ls, Rs, Cs), thesegeneralized equations default to the well known mix equations where theCs channel is reduced by 3 dB and added to the Ls and Rs channels asfollows:Lts=Ls+0.707Cs, andRts=Rs+0.707Cs.It is believed that actual Dolby Ex systems phase shift Ls and Rs byplus and minus 45 degrees, respectively, to provide more depth to thesurround sound. The QS or SQ matrix systems cited in the PCT Publicationteach that technique.

As shown in FIG. 4, in a Dolby Ex system 40 the studio 42 provides a 6.1channel mix (L, R, C, Ls, Rs, Cs, Sub) where Cs is an additional centersurround channel. A matrix encoder 44 applies the Prologic codingalgorithm to the three surround sound channels (Ls, Cs, Rs) to matrixencode them into Lts and Rts. The 5.1 channels L, R, C, sub, Lts, Rtsare encoded using an AC-3, Sony or DTS encoder 46 and recorded onto amedia 48. A 5.1 decoder 50 decodes the audio data to reproduce thediscrete L, R, C and Sub audio channels and pass the matrix encoded Ltsand Rts channels to a matrix decoder 52, which matrix decodes thechannels into Lrs, Crs and Rrs using the same active matrix techniquesas the Pro Logic decoders. The 6.1 discrete channels are directed todiscrete speakers 54 for audio playback.

It is important to note that the three discrete surround channels do NOTcarry discrete signals. The same crosstalk and phase distortionlimitations associated with Prologic are now reintroduced into what wasa truly discrete multichannel system. While it is true that a listener'ssensitivity to position and direction is less for rear signals, truediscrete audio reproduction will provide better sound separation anddirectionality. For the same reasons consumers preferred 2-channelsurround over mono surround they will prefer 3-channel discrete surroundover matrixed 2 channel surround.

Dolby EX represents a first step toward enhanced multichannel audio.Dolby EX provides additional surround sound channels using existing 5.1formats without increasing the bit rate. Furthermore, Dolby EX preservesthe discrete coding of L, R, C and sub audio signals. However, Dolby EXachieves these desirable results by sacrificing the true discreteness ofthe surround sound channels. A 3:2:3 system will suffer the samecrosstalk limitation as Pro Logic. 4:2:4 and greater systems will alsosuffer phase distortion problems due to the matrix decode.

Dolby cannot provide true discrete N.1 audio because audio qualityand/or reliability will suffer. The PCT Publication contemplates andthen dismisses a new N.1 format for truly discrete audio stating“Although, in theory, additional channels could be carried by reducingthe symbol size in order to provide more bits and allowing the storageof more data in the same physical area, such a reduction would introduceunwanted difficulties in the printing process and require substantialmodification or recorder and player units in the field.” A true N.1format would be incompatible with existing hardware and would require atleast substantial modification if not total replacement.

Accordingly, there remains an unfulfilled need in the industry toprovide a truly discrete multichannel surround sound environment withmore than two surround channels while maintaining backward compatibilitywith existing 5.1 decoders without sacrificing audio quality orreliability.

SUMMARY OF THE INVENTION

In view of the above problems, the present invention provides a trulydiscrete multichannel audio environment with additional discrete audiosignals while maintaining backward compatibility with existing decoders.

A truly discrete as well as a backward compatible mix forsurround-sound, front or other discrete audio channels for cinema, hometheater, or music by mixing additional discrete audio signals with theexisting discrete audio channels into a predetermined format such as the5.1 audio format. These additional discrete audio channels areseparately encoded and appended to the predetermined format as extensionbits in the bitstream.

In a 5.1 channel environment, the more than two discrete surround-soundaudio signals (Ls, Rs, Cs, . . . ) are mixed into two discretesurround-sound channels (Lts, Rts). The front channels (L, R, C, sub)and the mixed surround-sound channels (Lts, Rts) are encoded using astandard 5.1 encoder. The additional discrete surround-sound audiosignals (Cs, . . . ) are independently encoded and carried in a discreteextension surround-sound channel that is appended to the 5.1 bitstreamas extension bits. The bitstream is compatible with a variety of decoderconfigurations including existing 5.1 decoders, a 5.1 decoder plusexisting matrix decoders, a 5.1 decoder plus a mix decoder and a N.1decoder. The inclusion of the additional discrete surround-sound audiosignals in the bitstream makes possible the reproduction of truediscrete multichannel audio when used with either the 5.1 decoder plusthe mix decoder or the N.1 decoder.

A 5.1 decoder reads the 5.1 bitstream and ignores the extension bits.The 5.1 decoder decodes the Lts and Rts surround-sound channels anddirects the mixed audio signals to the discrete left and rightsurround-sound speakers. Playback creates the discrete left and rightsurround-sound signals and a “phantom” surround-sound signal from thecenter surround (Cs) audio signal and any other additional surroundsignals that acoustically appears at the center of the left and rightsurround speaks. The phantom surround is completely devoid of any phasedistortion.

The inclusion of a matrix decoder with the 5.1 decoder decodes the Ltsand Rts channels into Lrs, Rrs and Crs matrixed audio signals, which arecarried on discrete channels to left, right and center surroundspeakers. The Lrs, Rrs and Crs audio signals are not discrete andexhibit the crosstalk associated with matrix coding.

The inclusion of a mix decoder with the 5.1 decoder reads the extensionbits and decodes the additional surround-sound audio signals (Crs, . . .). The mix decoder subtracts the weighted surround sound audio signals(Crs, . . . ) from the left and right total surround-sound signals(Lrts, Rrts) to produce truly discrete surround-sound audio signals(Lrs, Rrs, Crs, . . . ), which are carried on discrete channels todiscrete speakers. A true N.1 decoder incorporates the 5.1 decoder andmix decoder in a single box. Playback creates a truly discrete (discretesignals carried on discrete channels to discrete speakers)surround-sound environment in which the surround-sound portion exhibitsimproved sound separation and directionality. Unlike matrix-encodedsurround-sound audio, the mix-encoded N.1 channel audio providesdiscrete playback without crosstalk.

These and other features and advantages of the invention will beapparent to those skilled in the art from the following detaileddescription of preferred embodiments, taken together with theaccompanying drawings, in which:

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1, as described above, is a block diagram and schematic plan viewof a known Dolby Prologic surround-sound system and a theater showingidealized loudspeaker locations for reproducing left (L), center (C),right (R) and surround (S) motion picture soundtrack channels;

FIG. 2, as described above, is a block diagram and schematic plan viewof a known 5.1 surround-sound system and a theater showing idealizedloudspeaker locations for reproducing left (L), center (C), right (R),sub and surround (S) motion picture soundtrack channels;

FIG. 3, as described above, is a block diagram of a known DTS 5.1surround-sound system that uses a 5-channel APT-X encoder;

FIG. 4, as described above, is a block diagram and schematic plan viewof a known Dolby EX surround-sound system and a theater showingidealized loudspeaker locations for reproducing left (L), center (C),right (R), left surround (Ls), right surround (Rs) and center surround(Cs) motion picture soundtrack channels;

FIG. 5 is a block diagram of a surround-sound encoder in accordance withthe present invention for providing discrete N.1 channel audio that isbackward compatible with 5.1 channel audio;

FIG. 6 is a schematic illustration of a N.1 channel bitstream inaccordance with the present invention;

FIG. 7 is a block diagram and schematic plan view of a known 5.1 decoderwith a loudspeaker arrangement for reproducing left (L), center (C),right (R), left surround (Ls), right surround (Rs) and “phantom” centersurround (Cs) audio channels based on a 3:2 mix in accordance with thepresent invention;

FIG. 8 is a block diagram and schematic plan view of a 5.1 decoder andmatrix decoder with a loudspeaker arrangement for reproducing left (L),center (C), right (R), left surround (Ls), right surround (Rs) andcenter surround (Cs) audio channels;

FIG. 9 is a block diagram and schematic plan view of a 5.1 decoder witha mix decoder with a loudspeaker arrangement for reproducing left (L),center (C), right (R), left surround (Ls), right surround (Rs) andcenter surround (Cs) audio channels in accordance with the presentinvention;

FIG. 10 is a block diagram and schematic plan view of a 6.1 decoder witha loudspeaker arrangement for reproducing left (L), center (C), right(R), left surround (Ls), right surround (Rs) and center surround (Cs)audio channels;

FIG. 11 is a schematic diagram of the mix decoder shown in FIG. 9 andincorporated in the 6.1 decoder shown in FIG. 10;

FIG. 12 is a block diagram of an alternate embodiment for the N.1channel encoder, which provides enhanced mixing capability but requiresboth a 5.1 and N.1 mix from the studio and additional extension bits;and

FIG. 13 is a block diagram of a multichannel audio encoder for providinga truly discrete as well as a backward compatible mix forsurround-sound, front or other discrete channels.

DETAILED DESCRIPTION OF THE INVENTION

The present invention provides a multichannel audio format for a trulydiscrete as well as a backward compatible mix for surround-sound, frontor other discrete audio channels in cinema, home theater, or musicenvironments. The additional discrete audio signals are mixed with theexisting discrete audio channels into a predetermined format such as the5.1 audio format. In addition these additional discrete audio channelsare encoded and appended to the predetermined format as extension bitsin the bitstream. The existing base of multichannel decoders can be usedin combination with a mix decoder to reproduce truly discrete N.1multichannel audio. This allows a consumer or professional to choosewhether to keep their existing audio systems and realize some of thebenefits of additional surround-sound channels or to upgrade theirsystems by adding a mix decoder to realize truly discrete multichannelaudio for the ultimate listening experience.

It is to be understood that the present approach is applicable to extendany predetermined multichannel audio format, of which 5.1 is the currentstandard, to greater number of channels of discrete audio whilemaintaining backward compatibility to the predetermined format. Forexample, a true 10.2 format may be adopted for certain very specializedaudio systems. At some point after the adoption of such a 10.2 format itmay be desirable to extend that format to even more channels. Forpurposes of clarity, the present invention will be described withreference to a 5.1 channel system without lack of generality.

For purposes of clarity, the present invention will now be describedwith reference to the drawings in the context of a 5.1 channel system.FIG. 5 is a block diagram of a N.1 channel surround-sound encoder 100 inaccordance with the present invention. A studio 110 provides an N.1channel mix of which the L, R, C and Sub channels are passed directly toa 5.1 encoder 112 such as DTS Coherent Acoustics, Dolby AC-3 or SonySDDS. The Ls, Rs, Cs and any other additional surround-sound channelsare first passed to a mix encoder 114 that mixes the three or morechannels into Lts and Rts channels, which are then passed to 5.1 encoder112. 5.1 encoder 112 encodes the 5.1 channels and channel encoders 116a, 116 b, . . . encode the additional surround-sound channels,respectively. The channel encoders may use the same 5.1 encoderdefaulted to encode a single channel or other single channel encoders. Aframe formatter 118 appends the extension bits 120 a, 120 b, . . . foreach of the surround-sound channels to the 5.1 format bits 122 a frameat a time in bitstream 124 as shown in FIG. 6. Bitstream 124 is recordedon a media 126 such as a DVD, CD, DVT, or film in a digital format. Withfilm optically recorded symbols represent the digital information, thedigital information, in turn, represents discrete audio channels.Optical discs such as CDs or DVDs have pits impressed in the discsurface that represent digital information, the digital information, inturn, represents said discrete audio channels. Alternately, bitstream124 could be encoded on a carried signal and broadcast to consumers.Backward compatibility is maintained because existing decoders read onlythe 5.1 bits and ignore the extension bits. True discrete multichannelaudio is achieved with a new mix decoder that reads both the 5.1 andextension bits.

The inclusion of the additional surround-sound audio signals in both thetwo-channel mix and discrete channels eliminates the need to introduce aphase-shift in order to decode the three or more audio channels. Assuch, mix encoder 114 has more flexibility to mix the surround-soundchannels. For example, a coherent mix introduces no phase-shifts ordelays. This has the advantage that neither a direct 5.1 decode thatproduces a “phantom” surround channel or a 2:3 matrix-decode introducephase distortion. Alternately, mix encoder 114 could phase-shift the Lsand Rs signals to improve the depth of the matrix decoded surround-soundaudio. The key is that the phase term is not needed in order to decode,and that the inclusion of the additional channels in the bitstreamallows the mix decoder to reproduce discrete audio for either mixapproach.

Assuming a coherent mix, the generalized mixing equations are asfollows:Lts=Ls+ΣGiSi for i=0, 1, 2, . . .Rts=Rs+ΣHiSi for i=0, 1, 2, . . .where Gi and Hi are the gain coefficients and Si are the additionalsurround-sound channels.

In the special case of three surround-sound channels (Ls, Rs, Cs), thesegeneralized equations default to the well known mix equations where theCs channel is reduced by 3 dB and added to the Ls and Rs channels asfollows:Lts=Ls+0.707Cs, andRts=Rs+0.707Cs.

At this one point, a 3:2 mix of a center surround channel, thematrix-encode equations for the Dolby EX system and the mix-encodeequations of the present invention each default to the standardtechnique for mixing a center channel with left and right channels.Although the mix equations are identical at this one point, the systemof the present invention is fundamentally different than either Dolby EXor standard mixing practice. In those instances the additional signalsare only mixed into the left and right signals thereby sacrificing theability to reproduce discrete multichannel audio. The present inventiondetails a method for both producing discrete multichannel audio whilemaintaining backward compatibility. Unlike Dolby EX, this approachrequires additional bits (space) to encode the bitstream. However, asevidenced by the earlier adoption of left/right surround to replace monosurround, true discrete surround-sound audio will replace matrix-decodedsurround-sound audio.

The bitstream is compatible with a variety of decoder configurationsincluding existing 5.1 decoders, a 5.1 decoder plus existing matrixdecoders, a 5.1 decoder plus a mix decoder and a N.1 decoder. Mixing theadditional surround-sound signals with the left and right surroundsignal provides backward compatibility. The inclusion of the additionaldiscrete surround-sound audio signals in the bitstream makes possiblethe reproduction of true discrete multichannel audio when used witheither the 5.1 decoder plus the mix decoder of the N.1 decoder.

As shown in FIG. 7 a conventional 5.1 decoder 130 decodes bitstream 124a frame at a time by detecting the sync bit, reading 5.1 formatted bits122 and ignoring extension bits 120 a, 120 b, . . . Decoder 130 decodesthe 5.1 bits to reproduce left (Lr), center (Cr), right (Rr), subwoofer(Sub), left surround (Lrts), and right surround (Rrts) discrete audiochannels. The left, center, right and sub discrete channels, which carryrespective discrete audio signals, are directed to discrete speakers L,C, R and Sub (not shown) in a loudspeaker arrangement 132 for playback.The left and right surround channels, which carry a three-channel mix,are directed to discrete speakers Ls and Rs. This creates a “phantom”center surround (Crs) audio signal that appears acoustically between theLs and Rs speakers without the benefit of an actual speaker. Theposition of the phantom surround can be varied by adjusting the mix butis typically a center surround. Consumers with existing 5.1 decoders canchoose not to upgrade and still receive a compatible mix.

A conventional 5.1 decoder when used in a 3:2:3 system reproduces thesame multichannel audio experience for the encoding techniques describedin FIGS. 5 and 6 as it would with Dolby EX encoded audio data (providedthe Ls and Rs signals in Dolby EX are not phase shifted by 45 degrees).However, for N:2:N systems where N>3 or N=3 and the Ls and Rs signalsare phase shifted the audio experience is not the same. The encodingtechniques of the present invention will not exhibit the phasedistortion problems associated with Dolby EX.

As shown in FIG. 8, the basic playback configuration depicted in FIG. 7can be enhanced by the addition of a matrix decoder 134 and a centerchannel speaker Cs. Matrix decoder 134 matrix decodes the left and rightsurround-sound channels Lrts and Rrts into three discrete audio channelsRrs, Crs and Lrs that are directed to respective speakers Ls, Cs and Rsfor playback. Although the channels are discrete the signals they carryare not. The dematrixed audio signals exhibit the same crosstalk andphase distortion drawbacks as discussed above in connection with theDolby ProLogic system.

As discussed above in reference to FIG. 4, the Dolby EX system isdesigned for use with a 5.1 decoder and matrix decoder having this sameconfiguration. Again the 3:2:3 systems may be equivalent but the N:2:Nwill differ due to the phase-shift components in Dolby EX encoding. Inpractice even when N=3 there is a 45 degree phase shift applied to theLs and Rs signals.

As illustrated in FIGS. 7 and 8, the mix encoding techniques of thepresent invention maintain backward compatibility with 5.1 decoders andmatrix decoders. The audio performance is equivalent to Dolby EX for3:2:3 systems and improved when additional surround-sound channels areencoded.

The distinct advantage of the present encoding and formatting techniquesover Dolby EX, as illustrated in FIGS. 9-11, is the ability to reproducetruly discrete N.1 channel audio; discrete signals carried on discretechannels to discrete speakers. As evidenced by the industry's move frommatrix encoded/decoded multichannel audio to discrete 5.1 audio earlier,the consumer will prefer discrete N.1 channel audio over matrix-decodedN.1 channel audio.

As shown in FIG. 9, a 5.1 decoder 140 reads the 5.1 audio 122 frombitstream 124 and ignores the extension bits 120 a, 120 b, . . . ,decodes the L, C, R and Sub signals and passes them to respectivespeakers in a loudspeaker arrangement 142. Decoder 140 decodes the Ltsand Rts signals and passes them to a mix decoder 144, which ignores the5.1 audio bits and reads the extension bits. Mix decoder 144 decodeseach of these additional surround-signals and uses them to separate thethree or more surround-sound signals Lrs, Crs and Lrs from the Lts andRts, which are passed to discrete speakers Ls, Cs and Rs. As shown inFIG. 10, an N.1 decoder 145 incorporates the functions of the 5.1decoder and mix decoder in one box.

As shown in FIG. 11, mix decoder 144 includes a channel decoder 146 thatdecodes the additional surround-sound channel Crs from the extensionbits and directs it to the center surround-sound speaker Cs. Mix decoder144 weights Csr (148 a, 148 b), e.g. reduces it by 3 dB, and subtracts(150 a, 150 b) it from the Ltrs and Rtrs signals to remove all traces(except quantization noise) of the center surround-sound channel Csleaving only the discrete Lrs and Rrs signals, which are directed toleft and right surround-sound speakers Ls and Rs. More specifically thedecode equations for a 2:3 decoder are as follows:Lsr=Lts −0.707Csr, andRsr=Rts −0.707CsrThe circuit is easily expandable to accommodate more than threesurround-sound signals by using additional channel decoders, multipliersand summing nodes.

As shown in FIGS. 9-11, the incorporation of the additionalsurround-signals at mix decoder 144 provides the N equations for the Nunknowns in the mixed audio signal carried on Lts and Rts. As a result,other than quantization noise, the process of separating the audiosignals is exact, i.e. no crosstalk or phase distortion. Thereforeconsumers who upgrade by purchasing either a mix decoder for use withtheir 5.1 decoder or a new N.1 decoder receive all the benefits of atruly discrete (signal, channel & speaker) system and an N.1 bitstreamformat.

It is important to note that the audio quality obtained by mixing thethree or more surround-sound channels into a 5.1 format and appendingthe additional surround-sound signals as extension bits, and separatingthe audio signals as just described would be substantially the same asthe audio quality associated with a true N.1 format, which would not bebackward compatible with 5.1 systems. This slight advantage is easilyoutweighed by the necessity to provide backward compatibility.

Although the described audio mixing/separating techniques and modifiedbitstream format are generally applicable to all 5.1 formats includingDolby-AC3 and Sony SDDS they are particularly well suited for use withthe DTS Coherent Acoustics, which has the ability to vary frame size asis described in detail in U.S. Pat. No. 5,978,762. The variable framesize can be used to accommodate additional surround-sound channels, i.e.the extension bits by either a) reducing the frame size or b) adaptivelychanging the frame size. Dolby AC-3 has a fixed frame size withinsufficient bits to accommodate the extension bits without sacrificingfidelity of the reconstructed audio signals.

The DTS Coherent Acoustics encoder/decoder can vary its frame size byone bit at a time. DTS Coherent Acoustics has the flexibility to reduceframe size to increase the bit rate to accommodate N.1 systems andparticularly the extra extension bits. The reduction of frame sizeincreases the percentage of bits allocated to overhead and reduces theflexibility for bit allocation but allows true discrete N.1 channelaudio to be reproduced with sufficient sound quality.

An alternate embodiment for encoding N.1 channel audio (N=3 as depicted)is shown in FIG. 12. This approach provides enhanced mixing capabilitybut requires both a 5.1 and 6.1 mix from the studio and additionalextension bits. Studio 150 provides both a 5.1 mix 152 and a 6.1 mix 154of which only the Ls, Cs and Rs channels are used. The Lts and Rtschannels of the 5.1 mix have been mixed by the studio to include the Cschannel. The 5.1 mix is passed to a 5.1 encoder 156 that encodes themultichannel signal into a standard 5.1 audio format.

The Lts and Rts audio channels are weighted by coefficients C1 and C2and subtracted from the Ls and Rs audio channels from the 6.1 mix 154,respectively, to produce difference signals dLs and dRs. An encoder 158encodes Cs, dLs and dRs and passes them to a frame formatter 160 thatappends them as extension bits to the 5.1 audio format in the bitstream.Each additional channel added after 6.1 adds one new channel to theextension bits. This approach is not constrained by simple linearequations to mix the signals but requires two additional channels, dLsand dRs to encode the audio data.

To this point the invention has been described as a technique for mixingthree or more surround-sound channels into the left and rightsurround-sound channels. Although this is the current application forsuch techniques, the same techniques can be used to provide a trulydiscrete as well as a backward compatible mix for additional frontchannels, side channels, subwoofer or any other discrete channels.

As shown in FIG. 13, an N:M Mixer 170 mixes N discrete input signalsinto M channels that carry the N-channel mix. An encoder 172 encodes theM-channel audio signal into a predetermined format. Channel coders 174a, 174 b, . . . encode each of the L=N-M additional discrete audiosignals. A frame formatter 176 appends the encoded additional signals asextension bits to the predetermined format in a bitstream, which is thenrecorded on a media 178. This describes a general approach for extendinga predetermined multichannel audio format to a greater number ofdiscrete channels while maintaining backward compatibility with decodersdesigned for the predetermined format.

While several illustrative embodiments of the invention have been shownand described, numerous variations and alternate embodiments will occurto those skilled in the art. Such variations and alternate embodimentsare contemplated, and can be made without departing from the spirit andscope of the invention as defined in the appended claims.

1. A medium recorded with M discrete audio channels that carry N mixedaudio signals, and at least one discrete extension channel that carriesat least one of the N audio signals.
 2. The medium of claim 1, whereinthe M discrete audio channels are recorded on the medium in apredetermined multichannel audio format and the discrete extensionchannel is recorded on the medium as extension bits appended to saidpredetermined multichannel audio format.
 3. The medium of claim 2,wherein the medium is compatible for playback on an existing base ofaudio decoders configured to read media recorded with the predeterminedmultichannel audio format and on audio coders configured to read mediarecorded with the predetermined multichannel audio format and thediscrete extension channel.
 4. The medium of claim 2, wherein the Mdiscrete audio channels and at least one discrete extension channel arerecorded on the medium as a sequence of audio frames that form a digitalbitstream, each said audio frame including a sync word, audio data in apredetermined multichannel audio format representative of said discreteaudio channels, and audio data appended to said multichannel audioformat representative of said discrete extension channel.
 5. The mediumof claim 4, wherein the size of the audio frame may vary fromframe-to-frame to accommodate the discrete extension channels.
 6. Themedium of claim 1, wherein M equals N.
 7. The medium of claim 1, whereinN is greater than M.
 8. The medium of claim 7, wherein the medium isrecorded with N minus M discrete extension channels each carrying adifferent one of the N audio signals.
 9. The medium of claim 8, whereinthe M equal two discrete audio channels are surround-sound channels, themedium being further recorded with three discrete front channels thatcarry respective discrete front audio signals and a sub channel thatcarriers a subwoofer signal, said three front channels, twosurround-sound channels and sub channel being recorded on the medium ina 5.1 audio format and said discrete extension surround-sound channelbeing recorded as extension bits appended to said 5.1 audio format. 10.The medium of claim 8, wherein the medium is recorded with N minus twodiscrete extension channels each carrying a different one of the N audiosignals.
 11. The medium of claim 9, wherein the N audio signals comprisea left surround signal, a right surround signal and center surroundsignal.
 12. The medium of claim 1, wherein the N audio signals comprisesurround-sound audio signals.
 13. The medium of claim 1, wherein saiddiscrete audio channels are carried on said medium in a digital format.14. The medium of claim 13, wherein said medium is film having opticallyrecorded symbols representing digital information, the digitalinformation, in turn, representing said discrete audio channels.
 15. Themedium of claim 13, wherein said medium is an optical disc having pitsimpressed in the disc surface representing digital information, thedigital information, in turn, representing said discrete audio channels.16. The medium of claim 15, wherein said medium is a digital video disc(DVD).
 17. The medium of claim 1, wherein the media is recorded with Ndiscrete extension channels of which N minus M channels carry discreteaudio signals and the remaining M channels carry difference audiosignals.
 18. The medium of claim 17, wherein the M difference audiosignals represent a weighted difference between an M channel audio mixand the corresponding M channels in an N channel audio mix.
 19. Anarticle of manufacture, comprising: a portable machine readable storagemedium; and a digital bitstream representing a multichannel audio signalincluding N audio signals that are mixed onto M discrete audio channelswhere N>M with at least one of said N audio signals being carried on atleast one discrete extension channel, said discrete audio channels anddiscrete extension channel being recorded onto said portable machinereadable storage medium as a sequence of audio frames, each said audioframe comprising: audio data in a predetermined multichannel audioformat representative of said discrete audio channels, and audio dataappended to said multichannel audio format representative of saiddiscrete extension channel.
 20. The medium of claim 19, wherein the sizeof the audio frame may vary from frame-to-frame to accommodate thediscrete extension channels.
 21. The medium of claim 19, wherein saiddiscrete audio channels are carried on said medium in a digital format,said medium is an optical disc having pits impressed in the disc surfacerepresenting digital information, the digital information, in turn,representing said discrete audio channels.
 22. A multichannel audiosignal encoded with M discrete audio channels that carry N mixed audiosignals and a discrete extension surround-sound channel that carries oneof the audio signals.
 23. The signal of claim 22, wherein the M discreteaudio channels are encoded in a predetermined multichannel audio formatand the discrete extension channel is encoded and appended to saidpredetermined multichannel audio format.
 24. The signal of claim 23,wherein the signal is encoded with N minus M discrete extension channelseach carrying a different one of the N audio signals.
 25. The signal ofclaim 22, wherein said signal comprises a carrier signal that is encodedwith the M discrete audio channels and the discrete extension channel.26. The medium of claim 25, wherein the signal is encoded with Ndiscrete extension channels of which N minus M channels carry discreteaudio signals and the remaining M channels carry difference audiosignals.
 27. The medium of claim 26, wherein the M difference audiosignals represent a weighted difference between an M channel audio mixand the corresponding M channels in an N channel audio mix.
 28. A methodof recording multichannel audio on a media with an extendedmultichannnel audio format for reproduction by either an existing baseof audio decoders configured to read media recorded with a predeterminedmultichannel audio format or a base of audio decoders configured to readmedia recorded with the extended multichannnel audio format, comprising:mixing sound information for N audio signals; mixing the N audio signalsinto M audio signals where M is less than N; recording said M audiosignal in M discrete audio channels with the predetermined multichannelaudio format onto the media; and recording at least one of the audiosignals in respective discrete extension audio channels as extensionbits appended to the predetermined multichannel audio format on themedia, which together comprise the extended multichannnel audio format.29. The method of claim 28, wherein said discrete audio channels anddiscrete extension channel are recorded onto said media as a sequence ofaudio frames, further comprising varying the size of the audio framefrom frame-to-frame to accommodate the discrete extension channels. 30.The method of claim 28, where N minus M of the audio signals arerecorded in respective discrete extensions audio channels.
 31. Themethod of claim 28, wherein said audio signals comprise left, right andcenter surround-sound signals, sound information for three front and asub audio signal being mixed and recorded with the two discretesurround-sound channels in a 5.1 channel audio format.
 32. The method ofclaim 28, wherein the N and M audio signals represent N-channel andM-channel mixes, respectively, further comprising: computing weighteddifference signals between the M-channel mix and the corresponding Mdiscrete audio signals in the N-channel mix, and recording the weighteddifference signals on respective discrete extension channels.
 33. Amethod of reproducing multichannel audio, comprising: receiving a mediahaving a recorded bitstream, said bitstream including M discrete audiochannels that carry N mixed audio signals where N>M in a predeterminedmultichannel audio format and at least one discrete extension channel,each one carrying one of the N audio signals as extension bits appendedto said predetermined multichannel audio format; reading out the bits inthe predetermined multichannel audio format while ignoring the extensionbits; decoding the bits in the predetermined multichannel audio formatto reproduce said M discrete audio channels; applying the M discreteaudio channels to respective discrete speaker channels to reproduce Mdiscrete audio signals and at least one phantom audio signal.
 34. Themethod of claim 33, wherein said M discrete audio channels are left andright surround-sound channels and said N discrete audio signals compriseleft, right and center surround-sound signals, said discrete extensionchannel carrying said center surround-sound signal, which althoughignored during read out, is reproduced as a phantom center surroundsignal.
 35. The method of claim 33, wherein said discrete audio channelsand discrete extension channel are recorded onto said media as asequence of audio frames that are read out a frame at a time, the sizeof the audio frame recorded on said media being varied fromframe-to-frame to accommodate the discrete extension channels.
 36. Amethod of reproducing multichannel audio, comprising: receiving a mediahaving a recorded bitstream, said bitstream including M discrete audiochannels that carry N mixed audio signals where N>M in a predeterminedmultichannel audio format and at least one discrete extension channel,each one carrying one of the N audio signals as extension bits appendedto said predetermined multichannel audio format; reading out the bits inthe predetermined multichannel audio format while ignoring the extensionbits; decoding the bits in the predetermined multichannel audio formatto reproduce said M discrete audio channels; matrix decoding the M audiochannels to provide N discrete audio channels that carry matrix decodedaudio representations of the N mixed audio signals; and applying the Ndiscrete audio channels to respective discrete speakers channels toreproduce matrix decoded multichannel audio.
 37. The method of claim 36,wherein said M discrete audio channels are left and right surround-soundchannels and said N discrete audio signals comprise left, right andcenter surround-sound signals, said discrete extension channel carryingsaid center surround-sound signal, which because it is ignored duringread out, is reproduced as matrix decoded audio.
 38. The method of claim36, wherein said discrete audio channels and discrete extension channelare recorded onto said media as a sequence of audio frames that are readout a frame at a time, the size of the audio frame recorded on saidmedia being varied from frame-to-frame to accommodate the discreteextension channels.